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ABSTRACT. We present a new approach to the statistics of the cosmic density 
field and to the mass distribution of high-contrast structures, based on the for- 
malism of Cayley trees. Our approach includes in one random process both fluc- 
tuations and interactions of the density perturbations. We connect tree-related 
quantities, like the partition function or its generating function, to the mass dis- 
tribution. The Press & Schechter mass function and the Smoluchowski kinetic 
equation are naturally recovered as two limiting cases corresponding to indepen- 
dent Gaussian fluctuations, and to aggregation of high-contrast condensations, 
respectively. Numerical realizations of the complete random process on the tree 
yield an excess of large-mass objects relative to the Press & Schechter function. 
When interactions are fully effective, a power-law distribution with logarithmic 
slope -2 is generated. 

Subject headings: cosmology: theory - galaxies: clustering - large scale structure 
of the universe - galaxies: formation 

1. INTRODUCTION 

The mass distribution N(M) of high-contrast structures constitutes a central 
link between observed extragalactic sources and the physics of the early universe. 

In the canonical scenario such structures form by direct hierarchical collapses 
(DHCs). In the expanding Universe small overdensities above the decreasing back- 
ground density p are weakly gravitationally unstable (see Peebles 1993). The con- 
trasts 5 = IS.pl ' p grow slowly in their linear regime, with 5 oc t 2 ' 3 in a critical 
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universe. As the contrasts approach unity the perturbations detach from the Hub- 
ble expansion, turn around, and in a comparable time non linearly collapse to end 
up in high-contrast virialized structures. 

As for the "initial" conditions, the physics of the early universe (see Kolb 
& Turner 1990) suggests the perturbations started in the form of a Gaussian 
random field. Their power spectrum at z < 10 3 may be piecewise approximated 
by \5k\ 2 oc k n with —3 < n < 1. So nonlinear conditions /c 3 |5fc| 2 ~ 1 are reached 
sequentially at larger and larger sizes, with the rich clusters (Abell 1958) forming 



now. 



For the range of masses M <L 10 12 Mq the collapses are little affected by 
dissipation, and the DHC scenario gives rise yo a theory of elaborate elegance (see 
Peebles 1965; Gunn & Gott 1972; Press & Schechter 1974, hereafter PS; Rees & 
Ostriker 1977; Bond et al. 1991). The linear density field 5 is smoothed or averaged 
at each point with a filter function of effective size R corresponding to a mass 
M oc pR s . On each scale M, the variance of 5{M) yields the dispersion a oc M~ a 
(a = 1/2 + n/6) of the "initial" Gaussian distribution p(5, a). The nonlinear 
collapses are modeled after the pattern provided by the top-hat smoothing filter; 
namely, isolated spherical and homogeneous overdensities which virialize, with a 
definite characteristic mass M c (t) oc t 2 ' 3a in a critical universe, by the time when 
the actual contrasts reach values ~ 2 10 2 . As this time would correspond to a 
nominal contrast 5 C ~ 1.7 in the extrapolated linear behavior, such threshold is 
taken to separate the linear regime from bona fide condensations. 



The condensed mass fraction is provided by the fraction of spheres where the 
linear S(M) crosses the threshold S c . The mass assigned to each collapsed object 
is twice that of the largest sphere wherein such a condition applies, while the 
smaller ones are disregarded. This elaborate selection rule can be written simply 
as N(M) M dM = — 2pd f^° d5p(5, a), as originally proposed by PS, and in terms 
of m = M/M c (t) and a oc m~ a yields 

N(M,t)= P 2 °^ m-» e -</"'. (1.1) 

The single parameter 5 C should comprise the complex nonlinear dynamics 
of the collapses. Once its value is set on the basis of, e.g., the top-hat model, 
the functional form of eq. 1.1 at any given t self-similarly depends on the initial 
spectrum only; specifically, on the spectral index n and on the amplitude taken 
by a at the scale 8/i -1 Mpc singled out by unit variance in galaxy counts. Self- 
similarity is stressed in the analysis of Bond et al. 1991, who examine the linear 
field at one epoch (say, the present t ) on the resolution S oc M~ 2a oc a 2 , and 
compare the resulting S(S) with a lowering threshold 1.7 (t/t )~ 2 < 3 . 

The same authors clarify in terms of excursion sets the statistics underlying 
eq. 1.1. They show that S(S), when extracted from the Gaussian field using a sharp 
k-space filter, executes a simple (Markovian) random walk governed by a diffusion 
equation. The diffusive flux density per unit resolution of such trajectories, having 
their first up-crossing through the threshold 5 C within dM at M, yields the PS 
expression complete with shape and amplitude, in agreement with the selection 
rule. 



For all its background, the result 1.1 has a number of drawbacks. The above 
prescriptions avoid overcounting substructures and imply a uniform timescale 3t/2 
for all collapses. By the same token, however, they are likely to underplay sub- 
structures and to overestimate the normalization. In fact, computations which 
assign definite sizes and timescales to the collapses (Cavaliere, Colafrancesco & 
Scaramella 1991) yield longer permanence of substructures and a lower normaliza- 
tion for N(M,t). Equivalently, filters localized in the ordinary rather than in the 
k-space (Bond et al. 1991) change the mass assignment to the collapsing density 
peaks and yield different statistics and generally straighter shapes. Observations, 
especially in X-rays, image abundant substructures within clusters, ranging up 
to truly binary configurations (see Jones & Forman 1992; White, Briel, & Henry 
1993). 

In addition, a considerable body of evidence indicates that the direct collapse 
scenario is incomplete at the high-mass end. For example, at galactic scales the 
cD's outnumber the Schechter luminosity distribution (Schechter 1976; Bhavsar 
1989); the building up of their bodies is best understood in terms of aggregations 
of normal members in groups as described by the Smoluchowski kinetic equation 
(Cavaliere, Colafrancesco, & Menci 1992, hereafter CCM). Galaxy interactions 
and merging involving one bright partner are also likely to stimulate the emissions 
from active galactic nuclei at z ^ 2.5, as indicated by statistics, by morphologies 
of the host galaxies, and by the richness of their environments (see Heckman 
1993; Bahcall & Chokshi 1991); numerical experiments and theoretical studies 



also concur to this view (Shlosman 1990; Barnes & Hernquist 1991). At larger 
scales, the imaging X-ray observations referred to above provide many snapshots 
of groups and clusters at various stages of essentially binary aggregations. 

In fact, high-resolution N-body experiments (e.g., Brainerd & Villumsen 1992; 
Katz, Quinn, & Gelb 1993; Jain & Bertschinger 1994) show structure formation 
to be a far more complex process than envisaged by the simple DHC scenario. 
It includes, in addition to direct collapses, frequent encounters and aggregations 
within clusters and within larger scale, precursor structures with the dimension- 
ality of sheets and filaments. The resulting N(M,t) shows, relative to eq. 1.1, a 
slower evolution and a different shape due to an excess at large M. 

Here we propose a novel approach to a satisfactory N(M,t). We discuss in 
§2 and §3 how the mass distribution is generated by a complete statistics in the 
resolution-contrast plane. This comprises as limiting cases both the direct collapses 
from independent Gaussian fluctuations (§4), and pure dynamical aggregations of 
high-contrast condensations (§5). The competition and mixing between these two 
components is computed in §6, with a net outcome depending on the mass range 
and on ambient conditions. This balance is of keen interest because interactions 
are likely to contribute, as noted above, key common features of apparently diverse 
astrophysical phenomena in the nearby and in the distant Universe. 

2. DISORDER AND BRANCHING 

The language of Cayley trees with disorder (see Derrida & Spohn 1988) con- 



veniently describes at the "microscopic" level of density contrasts 5 how new col- 
lapses from the linear perturbations compete or combine with aggregations of the 
existing condensations. 

The tree is a computational structure (visualized by fig. 1) where at each 
step |U random weights Wi are extracted in a cascade following a sequence of links, 
which may randomly branch into two. As the generation number \i increases, the 
progressive product of such random weights Wi will be related to the probability of 
finding a fluctuation of the density field. The tree coordinate \i will be related to 
the mass scale, and the probability will be related to the number of condensations 
per unit mass. The end result at the "macroscopic" level will be a mass distribution 
N(M,t). 

The tree includes in one random process the following two components of the 
6 field: (1) disorder - with increasing resolution S oc a 2 oc M~ 2a the independent 
values taken at a given time by S(M) execute, as recalled in §1, a pure random 
walk; and (2) branching - 5 may also jump by stochastic "branching", actually 
coalescing two paths into one. 

The statistics of the combined process is conveniently derived in terms of the 
partition function computed at each generation \i along the tree in the direction of 
coalescence from an initial [i : Z fJi = J2\ pa ths] ^-i=u Wi > wnere the product refers to 
the i-th preceding serial links of the tree, and the sum includes all paths coalesced 
at \i. The distribution function of Z will be P M (Z), with moments 

<Z*% = JdZP fl (Z)Z k . (2.1) 



It proves technically convenient to set Wi = 1 + Vi, to exhibit the unbiased average 
Wi = 1 when v fluctuates around 0. In this representation the partition function 
reads ^' 

Z » = E n -^ C 1 + "*) • ( 2 - 2 ) 

[paths] 

For the specific tree in fig. 1 the definition of Z implies the following recursion 
relations to hold at each elementary step dfi: 



7 __ J (1 + v) Z/j, with probability 1 — rjdfj,, ,_ „, 

m+ m 1 (1 + v)[Zi fI u\ + Z 2fJ , U2] with probability r\d\i. 

The first line describes pure disorder, and the second includes branching; v and 
u are stochastic variables. Gaussian initial conditions for the perturbation field 



(i) 



For v — > 0, the limit we shall consider, this is indistinguishable from the other 
representation Z^ = Y2\ va ths] e M ° *■ The latter is heuristically attractive because 

one may directly identify yV - Vi with yV Si = 5 and use the relationship 5 = E/Eb with 

the energy E of linear perturbations in a critical universe normalized to the background 

_ v^m E-/E 
potential energy (Peebles 1980). The resulting Z^ = Y2\naths] e ^•=''» has the 

form of a standard partition function, and the counting expressed by the definition (2.2) 
implies a sum rule for the linear energies. Nonlinear superpositions of comparable en- 
ergy fluctuations are explicitly accounted for by Y2\vaths] m ^ ne definition (2.2) and 
corresponding to the relation (2.3b). 



imply for v a distribution which for small v goes into a Gaussian with variance 
D dfi proportional to the step length: 

g (v) = e - v2 / 2Dd ^/(2nDdfi) 1 / 2 . (2.4) 

Another stochastic function r(u), generally far from symmetric, governs the distri- 
bution of the weights u translating, as we shall show, the interaction probabilities. 
A compact way to embody all moments is in terms of the generating function 

G ll (x) = (e- (1+X ^). (2.5) 

Successive derivatives of G at x = are related to moments of Z of increasing 
order, as shown by the formal expansion 

Gv{x) = Y,{-l) k 0- + *) k (Z%/M • (2-6) 

k 

Correspondingly, the evolution of the moments can be embodied in a master 
differential equation equivalent to the recursion relations eq. 2.3. It follows from 
eq. 2.5 that in an interval dfj, the two components simply add with the weights 
provided by their probabilities as given eqs. 2.3, and their superposition gives 

G^+d^x) = (1 - rjdiS) G M + i]dnGl with (2.7) 

G^= / dv g(v) G^(x + v + vx), G^= I dur(u) G^{xu) . 
The continuous limit dfj, — > yields 



d^G^ = D 8 XX GJ2 + v (Gl - G M ) . (2. 



For, in this limit the lhs yields G M +d M — > G^ + d^G^dfi, and on rhs the variance of 
v shrinks proportionally to dfx, so that G^ — > G^ + [D + 0(x)] dfi d xx Gfj,/2. When 
the two sides are set equal, finite terms cancel out, and to the first order in d\i the 
above equation obtains near x — 0, which will be the relevant point. 

It is easily perceived, and is discussed in detail in §3, that in the limit of no 
branching (i.e., rj — ► 0) eq. 2.8 reduces to a pure diffusion equation for G ll (x), 
similar to the equation given by Bond et al. 1991 and recalled in the Introduction. 
The opposite limit of branching with no Gaussian noise yields, as we shall see in 
§5, the Smoluchowski equation for the kinetics of N(M, t) under binary interac- 
tions discussed by CCM. We next substantiate these two limits by examining the 
relationship of G^(x) with N(M, t), and that of the generation number \i with the 
resolution S or with the physical time t. 

We stress that solving eq. 2.8 for G^(x) is equivalent to computing the single 
moments of Z directly from the relations 2.3 and then synthetizing G^(x) from its 
expansion 2.5. The advantage of following this latter route is that the recursion 
relations are very simple, and especially suited for numerical work. On the other 
hand, the master eq. 2.8 is convenient for making contact with previous work in 
limiting cases and for discussing the balance of the two competing modes. The 
statistical effects of this competition are illustrated in fig. 2. 

3. FROM THE CAYLEY TREES TO THE MASS FUNCTION 

We first note that in the limit of no branching (i.e., r\ — ► 0) the remaining 
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terms of eq. 2.8 yield the structure of a diffusion equation 

d^Gp = Dd xx G^/2. (3.1) 

This, by the gauge (structure- preserving) transformation 

djjb = — d InS , dx 2 oc cM 2 .D/S' , (3-2) 

with the condition D dfi > (see eq. 2.4), can be made identical with the equation 
(Bond et al. 1991, Lacey & Cole 1993) 

d s Q = ds 5 Q/2 , (3.3) 

which governs the evolution of Q(S, S) dd, the density of trajectories (random 
walks) of S as the resolution S oc a 2 oc M~ 2a is increased or the scale is decreased. 
Because of its central role in what follows, we discuss the gauge 3.2 in more 
detail. We first integrate eq. 3.2b to 

x oc (5 — 5 c )/a , (3.4a) 

so that x = corresponds to 5 = 5 C ; thus the collapse threshold is embedded in 
the tree formalism as a zero point for the tree variable x. 

Then we specify (see fig. 3 and its caption) the relationship of the remaining 
tree coordinate \i and of its initial value \i with the physical variables M and t. 
At each time t the resolution scale S = M~ 2a corresponding to a given generation 
number \x is reckoned from a minimum S oc [M c (t) / e]~ 2a with e<C 1, correspond- 
ing to a maximum mass M max = M c (t)/e S> M c (t), so that eq. 3.2a is integrated 
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in the form 



1 s 

fJ' — fJ'o = —-In— = ln(e m) a . (3.46) 

2 D n 



'O 



Note that S contains the t-dependence of the initial condition \i for each realiza- 
tion of the tree. The tree coordinates fi Q and \x — \i Q are used as independent ones 
in what follows. The mass scale M is an independent variable in the frame of the 
physical coordinates (the plane M, t in fig. 3), but in terms of the tree coordinates 
becomes a function of m and t through the relation M = m M c (t) (see also caption 
to fig. 3). 

Having so specified the relations between the physical and the tree variables, 
we now elaborate a procedure for computing the mass function from the tree. 
Following the papers by Bond et al. 1991 and Lacey & Cole 1993, the PS selection 
rule in terms of Q(S, S) and N(m) = N(M)dM/dm is written as 

dSQ = ~dS[d s Q/2] Sc • (3.5) 

-00 ■'"c 

The second equality, which expresses the mass fraction as a density in resolution 
of flux across the (absorbing) boundary S c , is formally provided by integration 
over 6 of eq. 3.3. We note that within the above theory a Laplace transformation 
relates N(M) to Q(S, S). In fact, the integral form of the above relation may be 
recovered by applying the operator [ds]s c to both sides of the integral relation 

/ dm N(m)e- rn{s - 5c) = -£- I dS Q/2 . (3.6) 
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In other words, the selection rule and the diffusion equation imply that f dS Q/2 
is the Laplace transform of N(m) M c /p. ( 2 > 

Based on the close similarity of eqs. 3.1 and 3.3 when the gauge eqs. 3.2 are 
considered, we propose the following general form of the selection rule: 

N(m) 



N T 



m dm = —d dxG^(x) = — d/j [9 x G M (x)] = — d/j (Z M e M ). (3.7) 



Here iVV(t) = fN(m)dm oc p/M c , and the derivative is computed at x = 0, 
corresponding to 5 = 5 C after the gauge transformation 3.4a; note from the second 
term that additive components of G independent of \x will not matter. In integral 
form, the fraction of condensed mass is 

dm N(m) m = -N T / d/j [d x G^ , (3.8) 

where the last integral is actually ^-independent (see footnote 2 ). But the above 
relation also obtains by differentiation d x at x = of the relation 

dmN(m)e- mx = N T fdnGJx), (3.9) 



(2) 



In the following, unless otherwise specified, the integrals are meant to run over the 
full range of the variables. These are as follows: 5 E [— oo,+oo], m £ [M m i n , M c /e], 
and fi G [— oo, fx ]. The integrals over \x ranging in the last interval may be rewritten as 
integrals over [i — fi in the corresponding range, and are explicitly t-independent. 
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which expresses the generating function, normalized to Nt, as a Laplace transform 
of the mass distribution. The latter then obtains by anti-transforming G„(x). 

Equivalently, all successive moments of N(m) are obtained by successive 
differentiation of eq. 3.9. For example, the zeroth-order moment is given by 
/ dmN{m) = Nt f (I/aG^O), which fixes the normalization J dfiG^^O) = 1; the 
1st moment is given by eq. 3.8, consistent with the differential form 3.7. 

4. DISORDER: THE PS LIMIT 

We now show how the PS mass distribution may be derived from eq. 3.7 in 
the limit of no branching, that is, rj — > 0. 
The latter equation involves 

[d x G fl ] = (Z,e-^} = J2(-^ k - 1 (Z k ),/(k-l)l . (4.1) 

k=i 

The single moments of Z will be directly derived from the recursion relation eq. 
2.3a (with D = 1) in the form 

'^^ '""'S' 1 ' 4, (42) 

Expanding the binomial around v = to the lowest (second) significant order, 
integration of eq. 4.2 yields 

<^Wm=[i + ^^^]<^%- (4-3) 

In the continuum limit d\i — > this yields 

d ln(Z k ) „ = ^^ dft , (4.4) 
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which using the gauge 3.2a (i.e., dfi = —dlna) integrates to 

(Z k },/(Z k } = (a /a)^. (4.5) 

This relation implies that the first moment of Z is a constant relative to fj,, 
which is a natural consequence of the property w — 1. The next two moments 
yield the leading contributions to the sum 4.1. In fact, following eq. 4.5 and 
keeping only the ^-dependent terms (as noted just after eq. 3.7), we find 



Z 



2 



2 



(Z ll e- z >) = (Z 1-Z+ — + ... ) = ^ 2 - 1-v^+OR • (4.6) 



a 



Z 



ol „/a 5 



2 a 2 Va 5 



Recalling from §3 that a /a = [M/M max ] a = (em) a , it is seen that a /a < 1 
holds for M < M max = M c /e, and the last equality can be resummed to within 
0(e 5a ) to yield 

[d x G,] ^Z 2 ^e- z °^° 2 = Zl{em) a e- z °^ 2a . (4.7) 

The tree by itself, as any statistics, does not specify the dynamics of gravi- 
tational collapses, and provides only scaling behaviors: these, on substituting eq. 
4.7 in eq. 3.7, are 

"<•»> -w-- 2 mk [d ^ " nsr (f m)_2+ ° e ~^ mf '' 2 • (48) 

and turn out to be the same as in the PS distribution. As to the constants which 
do carry dynamical information, the initial condition Z may be set to S c , the 
counterpart here of the boundary condition used by Bond et al. 1991. In addition, 
the rescaling N(em) = N(m)/e holds, and the overall normalization follows from 
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the requirement f dfxG^O) = 1 derived in §3. Thus the full PS expression 1.1 is 
recovered. 

5. BRANCHING: THE SMOLUCHOWSKI LIMIT 

We now consider eq. 2.8 in the opposite limit of branching with no Gaussian 
noise. Then the remaining terms on rhs read 

d„G, = V (Gl-G,). (5.1) 

Equivalently, the change of single moments of Z may be derived directly from the 
recursion relation 2.3b with no noise, to yield 



(r),^ = d» I I dZ 1 dZ 2 P(Z t ) P(Z 2 ) (Z lfI + Z 2fI ) m 

+(1 - dp) J dZ P(Z) Z™ ; (5.2) 

here the coupling parameter r\ has been absorbed into a rescaling of fj, for con- 
venience as will become apparent after eq. 5.11. Expanding the binomial, this 
becomes 



k ryva — h 



(Z m )» +d » = dvj J dZ ld Z 2 P( Zl ) P(Z 2 ) J2 kl{m _ k)l z l Z Z 

+(l-d fJ ,){Z m ) fl . (5.3) 

In the continuous limit d\i — > this yields 

o (z m ), _ y (z% (z m ~ k ), (z^% 

" <"' ^ k\ (m-k)\ m! " l ' ' 



mi 

k=-\ 
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This relation has a structure similar to the Smoluchowski equation which 
governs the kinetics of binary aggregations of definite condensations (CCM): 



ON 1 



-M 



dt 2 



/ dM' K(M', M - M', t) N(M', t) N(M - M', t) 
Jo 

/•OO 

-N(M,t) dM' K(M,M',t)N(M',t) . (5.5) 

Jo 

The kernel K represents the rate of aggregation in a system of condensations with 
relative velocity V and interaction cross section E, and averages to r _1 = NtT,V 
in terms of the component number density iVj^t). 

Indeed, we show next that the above equation may be identified with eq. 5.4 
(or with the equivalent eq. 5.1) when dealing with high-contrast condensations 
insensitive to the perturbation field; that is, in the limit of no disorder (with 
5 = a) and of high-contrast condensations (i.e., 5 C — > 0). For a constant kernel K 
we again proceed directly. 

To this end, we first rewrite the discretized form of eq. 5.5, after dividing by 
Nt/2, in the form 



2 ONm , Nm ^ Nm-m> Nm 1 Nm 

M> 



N% dt ~Nt ~ 2-f N T ~Nt ~Nt ' [ ' } 



where the parameter K has been absorbed into a rescaling of t for convenience 
which again will become apparent after eq. 5.11 below. We then express the 
t-derivative in terms of the tree variable \i. The gauge eq. 3.2a yields d\i = 
— \dlnS = adlnM. In the tree frame (as we discussed in §3) M = mM c (t) oc 
tu/Nt hold, and one obtains 

dN T 1 nT , 
dfi oc — — - = -N T dt , (5.7) 

l\ r p z 
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with the last equality coming from eq. 5.5 integrated over M. Then eq. 5.6 can 

be written as 

d Nm _ v—- Nm-m' Nm' Nm 

Now the above equation exhibits even more clearly a structure similar to eq. 
5.4, and is identical to it provided that 

^ oc <*> (5.9) 

Nt ml 

holds. The simplest way to prove this relationship is to to write the integral 
relation 3.9 with the shorthand 1 + x = w in the form 

fc 

with the signs (— l) fc included into (Z k ) in view of the invariance of eq. 5.4 
relative to such transformations. Then we note that eq. 5.4 for m = 1 implies 
dln(Z) ^ oc d/j,, and hence (Z)^ oc Nt by eq. 5.7. For m = 2 it implies (Z 2 ) oc iVr 
asymptotically when Nt <C iVV holds; similarly for the higher orders. With 
{Z k )^/NT k\ — >const relative to fx, the integration on rhs of eq. 5.10 reduces to 
f dfiNT = J d[ie~^ in view of eq. 5.7, and yields a constant factor. Thus, writing 
the integral on the lhs of eq. 5.10 in discrete form, we find 

J2 N mW m ocJ^^iZ^^/kl , (5.11) 

m k 

where of course the dummy indexes k and m may be identified. We now apply 
to both members the operator [d™] w= o, which in fact is appropriate for high- 
contrast structures with 5 C — ► and 5 ~ a. The result is the relation 5.9, thus 
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completing the identification of the Smoluchowski equation of the form 5.5 with 
the tree recursion relations of the form 5.4. 

The proportionality constant in eq. 5.11 translates into a rescaling by a 
constant factor of the correspondence between the independent variable in eq. 5.4 
(which in full is fxrj), and that in eq. 5.6 (which in full is t/r). The statement that 
^ m (Z m ) M /A^T fn\ becomes time-independent holds for the solutions N(M, t) of the 
Smoluchowski equation in their asymptotic, self-similar stage where N(M)/Nj, is 
time-independent. 

The equivalence of the Smoluchowski equation 5.5 with a Cayley tree can also 
be proved when the kernel K(M, M') is mass-dependent. Then the equivalence 
may be recovered by going through the Laplace transform of eq. 5.5, as has been 
carried out explicitly by Peshanski (1992) for the case of multiplicative kernels 
K(M,M') = K(M) K(M'). The link between kernel and weight distribution on 
the tree r(u) is explicitly given by 

/•OO 

K(m)=C dur(u)e um , (5.12) 

Jo 

which amounts again to a Laplace transformation. The constant C oc r 1 ' 2 contains 

the normalization of K, since r(u) is normalized to 1. 

6. DISORDER AND BRANCHING SUPERPOSED 

We now compute numerically the mass function in conditions where both 
disorder and branching are effective. As has been said, for numerical work it is 
much easier and faster to use directly the recursion relations (eq. 2.3) for the 
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partition function Z^ - rather than the master equation 2.8. 

This is done by the following procedure. At each step \i of the tree a numer- 
ical algorithm extracts the weights v from a Gaussian distribution, and uses the 
first recursion relation 2.3 with probability 1 — rj Afj,, or the second with proba- 
bility rj A//, to construct the partition function at the next step. In the present 
computations we use a delta-function r(u) = S(u), corresponding after eq. 5.12 to 
a constant interaction kernel. In fig. 2 we have shown the resulting trajectories of 
the contrast for increasing S, which are related to the tree variables x and \i by 
the gauge eqs. 3.4. 

For each tree, we generate the entire Z^ and its moments. The procedure is 
repeated for a large number of trees (up to 10 3 ). The moments of the partition 
function are used to compute 

[d x G ll ] = Y l (-^ k - 1 (Zi)/(k-l)\, (6.1) 

fc=i 

where the sum we actually used runs up to k = 10. Then at any given t we 
compute the mass distribution N(M, t) following eq. 3.7, and check that the total 
mass is conserved in time to within 1% when 10 3 trees are used. 

The result depends on the branching probability rj, which acts like an effective 
coupling constant for binary interactions. For small rj ^ 10 -2 the diffusion part of 
eq. 2.8 dominates and one recovers the PS mass distribution. For increasing values 
of r\ the resulting distributions become steeper, while the cutoff at large masses 
tends to straighten up. When rj ~ 10 _1 the distribution reaches its asymptotic 
form, a pure power law with logarithmic slope close to —2; see fig. 4 
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There are two reasons why even a small value of rj is effective. First, note 
that on transforming eq. 2.8 following the gauge eqs. 3.4 the effective coupling 
parameter is rj/S oc r)m 2a , that is, a coupling more effective at larger masses. 
Second, a kernel constant in time, as considered here, maximizes the interactions 
because it implies no dependence of the number density on ambient evolution. 
This is physically realistic for interactions within large scale structures surviving 
for longer then r. 

These results compare interestingly with the results from cosmological N-body 
simulations with a large dynamical range and highly resolved data analysis, like 
those performed by Brainerd & Villumsen 1992 and Jain & Bertschinger 1994. 
These papers agree in finding a slower evolution and an excess at large masses 
compared with the PS mass distribution, with the former finding consistency with 
a simple power law distribution, approximatively M~ 2 within the walls and fila- 
ments. These features are consistent with our findings. 

In terms of overdensities vs. resolution as used in fig. 2, the interactions cause 
not only an obvious density decrement of the trajectory distribution toward larger 
coalesced masses, but also a skewness relative to the Gaussian counterpart. 

7. DISCUSSION 

The existing theories for the shape and the evolution of the mass distribution 
N(M, t) fail to capture the full complexity of cosmic structures. The PS the- 
ory, with its recent elaborations, provides with eq. 1.1 the best quantification for 
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the scenario envisaging hierarchical collapses from initial density perturbations. 
Yet the result differs as to amplitude, shape, and evolution from data or from 
simulations and from realistic excursion set computations with the same input 
parameters. The alternative mode to hierarchically building up structure is based 
on binary aggregations between high-contrast condensations. This predicts non- 
Gaussian formation of rare large objects, and describes more closely the erasure 
of substructures within a structure by resolving timescales different from its dy- 
namical time td- But it requires input, at least initially, of formed condensations 
into an environment protected from the Hubble expansion (CCM). 

We submit that both these modes constitute only partial representations of 
the evolution of the density field. They select either purely Gaussian random walks 
of the linear density contrasts as functions of the resolution, up to crossing the 
threshold of nonlinearity; or trajectories "branching" stochastically only at large 
values of the contrast. Correspondingly, the PS function 1.1 and the Smoluchowski 
aggregation equation 5.5 correspond to restricted, "macroscopic" averages from 
a complete " microscopic" field statistics which treats aggregations and direct 
collapses as proceeding together at similar contrast levels. 

We propose such a complete statistics which combines, in the form of a Cayley 
tree (or random cascade) as represented in fig. 1, random walk and stochastic 
branching of fluctuations into one partition function governed by the single master 
equation 2.8. We also propose the Laplace transform relationship eq. 3.8 between 
the tree generating function G^(x) and the mass distribution N(M,t). 
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We have proved that our proposals indeed yield in the appropriate limits 
both the diffusion equation (§4) with the associated PS function eq. 1.1, and the 
Smoluchowski equation (§5). The former limit applies to Gaussian fluctuations 
that independently reach the nominal threshold for collapse and virialization. The 
latter limit is derived with no formal recourse to a threshold, and describes binary 
interactions of an initially given set of already formed condensations. Pleasingly, 
from averaging over the tree in the latter limit one obtains a mean-field, kinetic 
equation with different solutions depending on the kernel, and evolving away from 
the initial form; in the former limit, one instead obtains a single PS function. 

Both limiting processes are hierarchical and imply "merging", i.e., inclusion 
of smaller condensations into larger ones. But pure DHC in fact envisages only 
reshuffling of (generally) many condensations which belong to lower hierarchical 
levels into a higher level on a larger mass scale at a subsequent time, strictly 
following the conditions set ab initio in the linear fluctuation field. The natural 
quantification of this process is provided by the conditional probability that a 
trajectory up-crossing the threshold at the resolution £2 had a previous up-crossing 
at Si > S 2 (see Bond et al 1991; Bower 1991; Lacey & Cole 1993). 

Pure aggregation, on the other hand, envisages pairs of condensations coa- 
lescing into a third, at a similar - and high - contrast level. Here the nonlinear 
interactions are stochastically set by ambient density and relative velocities, and 
may be triggered at any time when larger scale structures outline volumes with 
expansion slowed down relative to the general "field" . 
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The Cayley tree approach not only proves successful in deriving the two, op- 
positely extreme modes for hierarchically building up structures, but also provides 
a number of links between them: the common tree algorithm or the master equa- 
tion 2.8; the Laplace transform relationship equation 3.8 of G^(x) with N(M,t), 
which extends the PS selection rule to interacting fluctuations; and the tree vari- 
able \i that includes both the resolution S (used to derive the PS function) and 
the physical time t (used to derive the Smoluchowski equation) , as stressed below. 

In fact, we have seen that d(i = —^dlnS holds, appropriate to the former 
case. On the other hand, from the relation M = mM c (t) oc m/Nx one obtains 
dtfJ, oc —dlnNr/dt, which is the eq. 5.7 used in deriving the Smoluchowski equa- 
tion. The constant factors (and specifically the spectral index n, the last remnant 
of the initial perturbation spectrum) are actually irrelevant in the subsequent 
identification of eq. 5.5 with the tree recursion relation in the absence of Gaussian 
noise, as expected. 

Above all, the tree formalism describes with equal ease the mixing and com- 
petition of the two pure modes; that is, it describes condensations interacting and 
aggregating while still growing or collapsing. The process may be computed ei- 
ther from the master equation, or directly in terms of the tree partition function, 
which is very convenient and fast on computers. These computations yield, when 
branching is fully effective, N(M, t) in the form of a steep, slowly lowering power 
law with logarithmic slope ~ —2. The result is due to the continuous input from 
the Gaussian noise over many scales, combined with the increasing effect of the 

24 



branching mode at larger masses. 

In fact, even a value of rj < 1 is effective because the relevant coupling param- 
eter rj/S oc rj m 2a increases at higher masses. Actually, the interactions are maxi- 
mized by a kernel constant in time as considered here, corresponding to densities 
and velocities unaffected by expansion. This is physically realistic for interactions 
taking place within large-scale structures which survive for times longer that r. 
In these conditions high-mass condensations form faster than the Gaussian rate 
within large scale structures, not unlike the formation action seen in large N-body 
simulations (Brainerd & Villumsen 1992; Babul & Katz 1992). Observations show 
that groups are concentrated in filaments and sheets outlined by redshift surveys 
of galaxies (Ramella et al. 1990) and that the mass distribution over scales from 
10 13 Mq to several 10 15 M© is consistent with a steep power law (Giuricin et al. 
1993). 

We shall discuss elsewhere the relationship of our approach with the dynamical 
descriptions of matter field under gravity - e.g., the adhesion model (see Shandarin 
& Zel'dovich 1989) or the frozen-flow approximation (Matarrese et al. 1992) - 
which include shear effects in addition to interactions of comparable fluctuations. 
Conditions of aggregating interactions protected from the Hubble expansion within 
larger structures are likely to underlie apparently diverse astrophysical phenomena, 
from formation of one large cD-like galaxy by strong interactions of members in 
groups, to the very formation and growth of groups and clusters within large scale 
filaments and sheets. Toward such a complex emergence of cosmic structure Cayley 
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trees provide a unifying approach. 
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FIGURE CAPTIONS 

Fig. 1. A schematic representation of a Cay ley tree. The random process is 
represented as a cascade proceeding from the bottom up. At each step (labeled by 
the generation number \i + dfj.) a random weight w = 1 + v is extracted according 
to a Gaussian distribution g(v). In addition, the path may either result from 
coalescing (with probability ndfi) of two branches that were still distinct at fi, 
or proceed along a single branch. The partition function at a point \i + dfi is 
constructed by summing the products of the weights over all paths leading to it. 
The heavy line marks a specific path on the tree with weights w\, W2, ... W7. 

Fig. 2. Contrast vs. resolution (in arbitrary units) of a number of trajectories 
computed from numerical realizations of the tree (details are given in §6). Top 
panel: Pure Gaussian noise resulting in random walks with dispersion increasing 
with increasing resolution (or decreasing mass) . Bottom panel: Effects of including 
random branching (with r\ = 0.1) in the statistics of the trajectories; the branching 
points are marked by a dot. The study of the apparent differences between the 
two panels constitutes the thrust of this paper. 

Fig. 3. Relations between the physical variables M and t and the tree coordinates 
\x and \x . At a given t (top panel), the initial value S for the resolution corre- 
sponds to a maximum mass scale M max = M c (t)/e ^> M c (t). It also corresponds 
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to a value of the tree generation number \x oc lnM max following eq. 3.2. So 
\x — Ho oc Inm holds, with m oc M/M c (t). For a given mass M (bottom panel), dif- 
ferent values of t correspond to various starting points fx , and to different values 
of fi visualized by the intercepts of a horizontal line for increasing t. 

Fig. 4. Mass function N(M) resulting from numerical realizations of trees with 
both branching and disorder. The partition function and the related moments are 
computed from the eq. 2.3. Dotted curve: rj = 0; dashed curve: rj = 0.05; solid 
curve: rj = 0.1. The first is identical with the Press & Schechter function, eq. 1.1. 
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